A missing data imputation method based on salp swarm algorithm for diabetes disease
نویسندگان
چکیده
Most of the medical datasets suffer from missing data, due to expense some tests or human faults while recording these tests. This issue affects performance machine learning models because values features will be missing. Therefore, there is a need for specific type methods imputing data. In this research, salp swarm algorithm (SSA) used generating and in pain my ass (also known Pima) Indian diabetes disease (PIDD) dataset, proposed called (ISSA). The obtained results showed that classification three different classifiers which are support vector (SVM), K-nearest neighbour (KNN), Naïve Bayesian classifier (NBC) have been enhanced as compared dataset before applying method. Moreover, indicated issa was performed better than statistical imputation techniques such deleting samples with values, replacing zeros, mean, random values.
منابع مشابه
Missing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملMissing Value Imputation Based on Data Clustering
We propose an efficient nonparametric missing value imputation method based on clustering, called CMI (Clustering-based Missing value Imputation), for dealing with missing values in target attributes. In our approach, we impute the missing values of an instance A with plausible values that are generated from the data in the instances which do not contain missing values and are most similar to t...
متن کاملKernel-Based Multi-Imputation for Missing Data
A Kernel-Based Nonparametric Multiple imputation method is proposed under MAR (Missing at Random) and MCAR (Missing Completely at Random) missing mechanisms in nonparametric regression settings. We experimentally evaluate our approach, and demonstrate that our imputation performs better than the well-known NORM algorithm.
متن کاملControl Stability Evaluation of Multiple Distribution Static Compensators based on Optimal Coefficients using Salp Swarm Algorithm
In order to solve the problem of voltage drop and voltage imbalance in the distribution systems, the injection of reactive power by multiple static compensators is used. The distributed generation such as photovoltaic systems could play a role of the static compensators by producing reactive power. In this paper, the integral to droop line algorithm is used to control the reactive power in busb...
متن کاملMultiple Imputation for Missing Data
Multiple imputation provides a useful strategy for dealing with data sets with missing values. Instead of filling in a single value for each missing value, Rubin’s (1987) multiple imputation procedure replaces each missing value with a set of plausible values that represent the uncertainty about the right value to impute. These multiply imputed data sets are then analyzed by using standard proc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bulletin of Electrical Engineering and Informatics
سال: 2023
ISSN: ['2302-9285']
DOI: https://doi.org/10.11591/eei.v12i3.4528